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GOODNESS-OF-FIT  TESTS  FOR  ADDITIVE  HAZARDS  AND 
PROPORTIONAL  HAZARDS  MODELS 


Ian  W.  McKeague 
The  Florida  State  University 

and 

Klaus  3.  Utikal 
University  of  Kentucky 


Abstract 


Goodness-of-fit  tests  for  Cox’s  proportional  hazards  model  and  Aalen’s  additive  risk  model, 
in  which  each  model  is  compared  on  an  equal  footing  with  the  best  fitting  fully  nonparametric 
model,  are  developed.  The  goodness-of-fit  statistics  are  based  on  differences  between  estimates 
of  the  doubly  cumulative  hazard  function  A(t,z)  =  A(s,x)  ds  dx,  under  each  model,  with 

a  fully  nonparametric  estimator  of  A  recently  introduced  by  the  authors.  Here  A(-,r)  denotes  the 
conditional  hazard  function  of  the  survival  time  of  an  individual  with  covariate  vector  Comparison 
of  the  results  of  the  tests  makes  it  possible  to  decide  whether  Cox’s  proportional  hazards  or  Aalen’s 
additive  risk  model  gives  a  better  fit  to  the  data.  In  addition,  a  goodness-of-fit  test  for  Cox’s  model 
within  the  family  of  all  proportional  hazards  models  A(/,z)  =  Ao(Or(-),  where  A0  is  a  baseline 
hazard  function  and  r  is  a  general  relative  risk  function,  is  developed. 


1.  Introduction 


^  Additive  hazards  and  proportional  hazards  regression  models  used  in  the  analysis  of 
censored  survival  data  can  give  substantially  different  results.  For  instance,  in  connection 
with  a  study  of  cancer  mortality  among  Japanese  atomic  bomb  survivors,  Muirhead  and 
Darby  (1987)  have  noted  that  the  two  models  give  substantially  different  estimates  of  the 
age-specific_probabjlity  that.^n  individual  will  develop  radiation  induced  cancer.  Muirhead 
and  Darby'fti'twaJm)  AramlarOidm,  1903) introduced  a  generalized  parametric  model  which 
contains  parametric  additive  hazards  and  proportional  hazards  models  as  special  cases.  The 
goodness-of-fit  of  each  model  is  then  obtained  by  comparing  with  the  best  fitting  model 
within  the  generalized  family,  allowing  the  two  special  models  to  be  treated  on  an  equal 


footings 

./""Beyond  the  parametric  setting,  much  effort  has  been  devoted  to  the  development  of 
goodness-of-fit  tests  for  Cox’s  (1972)  proportional  hazards  model 

\(t,z)  =  A0(t)e^,  (1.1) 


where  A (t,z)  =  A(<|z)  is  the  conditional  hazard  function  of  the  survival  time  of  an  individual 
who  has  a  covariate  vector  z  —  (zj, . . .  ,2P)',  say,  at  time  t,  /Jo  is  a  vector  of  p  parameters  and 
A0  is  an  arbitrary  baseline  hazard  function.  Many  papers  deal  with  graphical  methods  and 
their  interpretation — see  Arjas  (1988)  for  a  recent  contribution.  These  methods  are  useful 
for  detecting  and  diagnosing  possible  departures  from  the  model,  but  their  interpretation  in 
the  absence  of  formal  significance  tests  is  largely  subjective.  A  formal  test  for  Cox’s  model 
was  introduced  by  Schoenfeld  (1980)  who  developed  a  test  of  Cox’s  model  against  the  fully 
nonparametric  alternative  that  A(t,z)  is  arbitrary  by  using  a  chi-squared  test  based  on 
the  observed  and  expected  frequencies  that  data  points  fall  into  cells  that  partition  the 
product  of  the  time  and  covariate  state  spaces.  Using  a  similar  approach,  Andersen  (1982) 
introduced  a  test  of  whether  the  inclusion  of  a  new  covariate  zp+J  gives  rise  to  a  Cox  model 
when  (zj, . . .  ,zp)'  already  does.  Moreau,  O’Quigley  and  Mesbah  (1985)  considered  testing 
whether  0o  varies  with  time. 

Recently,  Aalen  (1988)  discussed  some  graphical  methods  for  examining  the  goodness- 
of-fit  of  the  additive  risk  model  (Aalen,  1980) 


v 

*(*»*)  =  £*;(*)  * 

j=i 


(1.2) 


where  oj,. . .  ,ctp  are  arbitrary  functions  of  time. 

- -  The  purpose  of  4he  /present  paper  is  to  develop  formal  goodness-of-fit  tests  for  the 

models  of  Aalen  and  Cox  in  which  each  model  is  compared  on  an  equal  footing  with  the 
best  fitting  fully  nonparametric  model.  Our  test  statistics  are  based  on  differences  between 
estimates  of  the  doubly  cumulative  hizard  function  A(t,z)  =  J* fQ  X(s,x)dsdx,  under 
each  model,  with  a  nonparametric  estimator  .4  of  A  introduced  by  McKeague  and  Utikal 
(1988).  Comparison  of  the  results  of  our  tests  makes  it  possible  to  decide  whether  Cox’s 
proportional  hazards  or  Aalen ’s  additive  risk  model  gives  a  better  fit  to  the  data. 

Goodness-of-fit  statistics  based  on  a  comparison  of  estimates  of  cumulative  hazard 
functions,  allowing  the  application  of  powerful  counting  process  and  martingale  techniques, 


v'y  *  r 

')  i  -  1 
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have  been  previously  studied  by  Hjort  (1984).  He  constructed  tests  for  the  hypothesis  that 
the  baseline  hazard  function  in  Cox’s  model  follows  a  given  parametric  form,  where  the 
relative  risk  function  r(z)  =  is  assumed  to  be  correctly  specified.  Hjort  obtained  a 
weak  convergence  result  for  the  difference  between  nonparametric  and  parametric  estimates 
of  the  cumulative  baseline  hazard  function,  and  used  his  result  to  construct  a  chi-squared 
statistic  based  on  a  division  of  the  time  domain  into  cells.  We  have  extended  this  approach 
to  the  full  Cox  model. 

The  general  model  (in  which  \(t,  z)  is  fully  nonparametric)  is  described  in  Section  2. 
Our  goodness-of-fit  tests  for  the  models  of  Cox  and  Aalen  are  presented  in  Sections  3  and  5 
respectively.  In  the  case  of  Cox’s  model,  with  p  =  1,  we  compare  A  with  the  semiparametric 
estimator  A  of  A  given  by 

A(t,z)  =  A(<)  /V'rf*,  (t,z)  6  [0,  l]2  (1.3) 

Jo 

where  A  is  an  estimator  of  the  baseline  hazard  function  and  (3  is  Cox’s  maximum  partial 
likelihood  estimator.  Under  Cox’s  model  the  two  estimators  should  be  close  to  one  another. 
We  show  that  y/n{A  —  -4)  converges  weakly  to  a  certain  Gaussian  random  field  which  is 
represented  as  a  sum  of  stochastic  integrals  with  respect  to  a  Brownian  sheet  process. 
This  result  leads  to  the  construction  of  our  goodness-of-fit  test  for  Cox’s  model  against 
the  general  alternative.  In  Section  4  we  develop  a  test  for  Cox’s  model  against  the  more 
restrictive  alternative  of  general  proportional  hazards:  \(t,z)  =  A0(<)r(2)*  where  r  is  an 
unspecified  relative  risk  function. 

Proofs  of  all  our  results  are  collected  in  Section  6. 


2.  The  general  model 

Let  N(t)  =  . . . ,  JVn(t))\  t  G  [0,1],  be  a  multivariate  counting  process  with 

respect  to  a  right-continuous  filtration  (/<),  i.e.  N  is  adapted  to  the  filtration  and  has 
components  Ni  which  are  right-continuous  step  functions,  zero  at  time  zero,  with  jumps  of 
size  +1  such  that  no  two  components  jump  simultaneously.  Here  N,(t)  records  the  number 
of  observed  failures  (0  or  1)  in  [0,1]  for  the  ith  individual.  Suppose  that  A \  has  intensity 
of  the  general  form 

Ai(0  =  Pi(0A(f,Z,(0),  i  =  1 , . . . ,  n 

where  Y,(<)  is  a  predictable  {0,l}-valued  process,  indicating  that  the  tth  individual  is  at 
risk  when  Y{(t)  =  1.  and  Zi(t)  is  a  predictable  covariate  process.  The  function  A (t,z) 
represents  the  failure  rate  for  an  individual  at  risk  at  time  1  with  covariate  Z,(<)  =  z.  We 
assume  throughout  that  (Ni,  Y),Z,),  i  =  l,...,n  are  i.i.d.  replicates  of  (A^,Y,Z)  and  Z  is 
scalar  valued.  Note  that  (see  Andersen  and  Borgan,  1985)  the  processes 

=  Ni(t) -[*¥'(*)  X(s,Zi(s))dst  t=l,...,n  (2.1) 

are  orthogonal  local  square  integrable  martingales  with  predictable  variation  process 

(M,,M,)<=  /  y,(s)A(s,Zj(s))rfs,  i  =  1, . . . ,  n. 

Jo 
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The  goodness-of-fit  tests  introduced  in  this  paper  involve  a  certain  estimator  for 
A(t,z )  =  JqJq  A (s,x)dsdx  over  the  unit  square  [0,  l)2.  Let  xr  =  r/dn  and  IT  -  [arr_i , arr ) 
for  r  =  1, . . . ,  dn,  where  dn  is  an  increasing  sequence  of  positive  integers.  Let  Nir(t )  be  the 
counting  process  which  registers  the  jumps  of  Ni(t)  when  £,(<)  £  Tr,  so  that 


Nir(t)=  f  I(Zi(s)e  Jr)  dNi(s). 
Jo 


Beran  (1981)  suggested  that  the  cumulative  conditional  hazard  function  A (t,z)  = 
f*  A (s,z)ds  could  be  estimated  by  the  Nelson-Aalen  type  estimator 


where 


A(M)=  /  Jy-  dNj.n)(s),  for  z  dir 
Jo  yrv  ( s ) 


Yr in)(s)='£l(Zi(s)€lr)Yi(s) 


and  N(rn)  =  AI,>.  McKeague  and  Utikal  (1988),  subsequently  referred  to  as  MU, 

proposed  the  following  estimator  for  A 


A(t,z)=  I  A(t,x)dx, 
Jo 


and  they  obtained  a  weak  convergence  result  for  A. 

Before  stating  that  result  we  need  to  introduce  some  more  notation  and  some  conditions 
on  Y  and  Z.  Let  f'f*  <f>(s,x)dlV(s,x)  denote  a  continuous  version  of  the  Wiener  integral 
of  a  function  4>  €  X*([0,  l]2,ds  dx)  with  respect  to  a  Brownian  sheet  W,  see  Wong  and  Zakai 
(1974).  Suppose  that  for  each  t  e  [0,1],  the  random  vector  (Zt,Yt)  is  absolutely  continuous 
with  respect  to  the  product  of  Lebesgue  measure  on  [0, 1]  and  counting  measure,  and  denote 
the  corresponding  density  by  fz(t)Y(t){z,y)-  Also,  assume  that  /z(<)  Y(t)(zi  1)  is  a  positive, 
continuous  function  of  (t,  z)  £  [0,  l]2.  Let  Di  denote  the  extension  of  Skorohod  space  D[ 0,  l] 
to  functions  on  [0,  l]2,  as  defined  in  Neuhaus  (1971),  and  let  D[0,  l]p  denote  the  product  of 
p  copies  of  D[ 0, 1]. 

In  the  present  setting  we  may  state  Theorem  3.1  of  MU  as  follows. 

Proposition  3.1.  Suppose  that  A  is  Lipschitz,  d„/n  — >  oc  and  dn  =  o(n#)  for  some 

6  €  (|,1).  Then  y/n{A  —  A)^m  in  Di  as  n  — *  oo,  where  m  =  ( m(t,z ),  (t,z)  £  [0,  l]2 )  is 
given  by 


i(M)  =  /  /  y/h(s,x)dlV(s,x), 
Jo  Jo 


h(s,x)  = 


A  (s,x) 

fz(i)  KfijCl)!) 


In  the  sequel  we  shall  denote  H(t,  z)  =  fgfj  h(u,x)  dx  du  and  denote  the  corresponding 
measure  on  [0,  l]2  by  H  as  well. 


3.  Cox’s  proportional  hazards  model  vs.  the  general  model 


Inference  for  0O  in  (1.1)  can  be  based  on  the  partial  likelihood  function 

(T»)  ^ 


w)=n{ 


>  e, 

o0Z,(Tt)  /  ’ 


(3-1) 


where  6,  and  7*  are  the  indicator  of  noncensorship  and  the  survival  time  for  the  ith  indi¬ 
vidual  respectively,  and  Hi  is  the  risk  set  consisting  of  all  individuals  who  are  observed  to 
be  at  risk  at  time  7).  This  approach  was  proposed  by  Cox  (1972,  1975).  Let  /?  be  the  value 
that  maximizes  L(/3)  and  estimate  the  cumulative  baseline  hazard  Ao(0  =  J0  Ao(s)  ds  by 
the  Bresiow  (1972,  1974)  type  estimator 


a(o  =  E 


Si 


T,<t 


e&ZATi)' 


(3.2) 


We  are  interested  in  testing  the  null  hypothesis  Hq:  Cox’s  proportional  hazards  model 
(1.1)  holds  over  the  region  ( t,z )  £  [0,  l]2.  The  natural  estimator  of  A  under  Ho  is 

A(t,z)  =  A(t)  f  e&xdx,  (t,z)G[0,l]2 

Jo 

where,  if  (Ti,  Zi(Ti))  falls  outside  [0,  l]2,  the  survival  time  T,  is  regarded  as  being  censored 
(i.e.  6i  is  set  to  0).  Introduce  some  notation  (cf.  Andersen  and  Gill,  1982): 

SU)(M  =  -  E Zi(t¥Yi(t)W  <  Z,(t)  <  l)e^K 


sO>(/M)  =  £S(i)(/M), 


for  j  =  0, 1,2,  where  0°  =  1,  and 


e  =  s<17s<°>,  u  =  s(2>/s<0)  -  e2, 

S=  /  v(/3t)s{o)(0o,t)Mt)dt. 

Jo 


Formulated  in  terms  of  the  counting  processes,  the  estimate  $  is  the  unique  solution  to 
•§0  log  L((J)  =  !/(/?,  1)  =  0,  where 

n  rt 


U(P,t)  =  E  l  {Z'W  "  }/(0  ^  Z-(u)  *  1  )dNi{u), 

and  the  estimator  (3.2)  is  given  by 


(3.3) 


M*) 


=  [- 


dN(u) 


o  nSW0,u)' 


4 


where  N  =  ^rn\  and  is  defined  in  Section  2. 


Theorem  3.1.  Suppose  that  Y  and  Z  are  left-continuous  with  right  hand  limits,  E  is 
positive,  Ao  is  Lipschitz,  d\jn  — >  oo  and  d„  =  o( ns)  for  some  6  £  ,  1).  Then,  under  Cox’s 

proportional  hazards  model  (1.1 ),  </n(A  —  A)^m'  in  Di  as  n  — *  oo,  where 


m'(t,  z ) 


h(u,x) 


j:j;  -  •«  jfjf 

AqOQ  e 

fz(v)  K(u)(*. 1)  ’ 


g(u,x)  =  A0(u)e^or/Z(u)y(u)(i,l), 

b(z)  =  [*  e0ozdx, 

Jo 

c(t,z)  =  E_1  ^A0(t)  J  xe0oXdx  -  b(z)  J  e((30 ,  u)  A0(u)  duj . 


In  order  to  test  H0  against  the  alternative  that  A  has  the  genera]  form  of  Section  2  we 
might  consider  using  statistics  of  Kolmogorov-Smirnov  type  or  Cram4r-von  Mises  type: 

\fn  sup  \A(t,z)  -  A(t,z)\  or  y/n  f  f  (A(t,z)  -  A{t,z))2  dtdz 
(<,*)elo,i)2  Jo  Jo 

which  have  asymptotic  distributions  sup{(  ^e[0  tlja  |m'(t,z)|  and  f*  f*(m'(t,z))2  dtdz  re¬ 
spectively.  However,  general  tables  for  these  distributions  are  not  available.  MU  suggested 
that  critical  values  for  such  distributions  be  obtained  by  simulation  of  the  process  m'.  A 
more  feasible  approach  might  be  to  bootstrap  the  estimators  A  and  A  in  some  way  [cf. 
the  papers  of  Akritas  (1986),  Horv&th  and  Yandell  (1987)  and  Lo  and  Singh  (1986)  on 
the  bootstrapped  Kaplan-Meier  estimator],  but  we  shall  not  pursue  that  possibility  here. 
Rather,  our  present  approach  [following  Schoenfeld  (1980)]  is  to  derive  a  chi-squared  test 
based  on  a  partition  of  the  product  of  the  time  and  covariate  state  spaces  into  cells. 

Let  0  =  to  <  •  •  •  <  tfl  =  1  and  0  =  zo  <  •  ■  ■  <  zt  -  1  and  denote  %  -  (tr_i,tT) 
and  Zi  =  (aj-1,2/]  so  that  the  cells  Jrt  —  Tr  x  Zi  partition  [0,  l]2.  The  increment 
of  X  =  y/n(A  -  A)  over  Jrl  is  given  by  =  X(Jrt)  =  X(tr,zi)  -  .Y(<r,a;_i)  - 
X(tr_i,zj)  +  A’(fr-i,zi_i).  Under  Ho  and  the  conditions  of  Theorem  3.1  we  have  that 
Q(ni  =  (Q^\  r  =  1,...,R;  l  =  1  converges  in  distribution  to  the  Gaussian  ran¬ 
dom  array  Q  =  ( Qri,  r  =  1, . . .  ,R;  /  =  1, . . .  ,L)  with  mean  zero  and  covariance 

Cov(Qr/,Qr,,,)  =  H(JrlnJr.,,)  -  b(Z,)b(Zr )  /  -  c(Jrl)c(Jr.,.)X, 

J Tt nT.i 


I 


where  b(Zi )  and  c{JTi)  denote  increments  of  b  and  c.  A  consistent  estimator  for  this 
covariance  can  be  obtained  by  inserting  the  usual  estimates  of  /?&,  Ao,  E  and  e(/?o,-) 
in  the  last  two  terms  above  and  estimating  the  first  term  by  H(Jr{),  where 


n_  W  f 1  dAojs) 

dl  h  Jo  Yrn\s) 


(3.4) 


The  estimator  H  is  similar  to  the  general  estimator  of  H  employed  in  MU  (Section  3). 
Routine  modifications  to  the  proof  of  Lemma  9  of  MU  show  that  H  is  consistent. 

If  we  write  and  Q  in  the  form  of  column  vectors  V and  U,  respectively,  (by 
stacking  columns  one  on  top  of  each  other,  say)  and  let  C ^  denote  the  corresponding 
estimate  of  the  covariance  matrix  C  of  U,  then  our  test  statistic  is  given  by 

f(">  =  [/("). 


Under  Ho  and  the  conditions  of  Theorem  3.1  we  obtain  that  has  a  limiting  x *  distri¬ 
bution,  w’here  q  =  rank(C).  Usually  we  would  expect  that  C  is  of  full  rank,  in  which  case 
q=RL. 


4.  Cox’s  model  vs.  general  proportional  hazards 


The  general  proportional  hazards  model  A (t,z)  =  Ao(t)  r(z),  where  r  is  an  unknown 
relative  risk  function,  was  proposed  by  Thomas  (1983).  This  model  admits  nonlinear  co¬ 
variate  effects  while  preserving  the  proportional  hazards  form.  Tibshirani  (1984),  Hastie 
and  Tibshirani  (1986)  and  O’Sullivan  (1986a,  1986b)  have  studied  various  estimators  for 
the  log  relative  risk  function  log  r(z).  MU  studied  an  estimator  of  the  cumulative  relative 
risk  function  J0r(x)dx  and  developed  a  goodness-of-fit  test  for  the  general  proportional 
hazards  model. 

Now'  consider  testing  the  Cox  model  null  hypothesis  H0  of  Section  3  vs.  the  alterna¬ 
tive  that  the  general  proportional  hazards  model  holds.  Our  test  statistic  compares  two 
estimators  of  the  normalized  cumulative  relative  risk  function 


R(z) 


Jo  r(x)dx 
Jo1  r(z)dx 


Since  r(z)  =  e0oZ  under  Cox’s  model,  the  natural  estimator  of  R  under  Ho  is 


R(z) 


Jo  dx 

Jo  ebx  dx 


An  estimator  for  R  under  the  general  proportional  hazards  model  is 


R(z) 


Ah*) 

•4(1,1)' 
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The  following  result  gives  the  asymptotic  distribution  of  y/n(R  -  R)  under  Hq. 


Theorem  4.1.  Suppose  that  the  conditions  of  Theorem  3.1  hold.  Then,  under  Cox’s 
proportional  hazards  model  (3.1),  y/n(R  —  R)-+pm"  in  Z)[0, 1]  as  n  — ►  oo,  where  p  = 
(Ao(1)&2(1))-\ 


m"(z )  =  6(1) 

p{z)  =  E"1 


/  j  ^ h(u,x )  dW{u,x)  —  b(z )  f  j  \/h(u,x)dW(u,x) 
Aq(1)^6(1)  y  xe0oXdx  -  b(z)  J  xe0oZ  dij. 


We  shall  omit  the  proof  of  this  result  since  it  is  very  similar  to  the  proof  of  Theorem 
3.1.  A  chi-squared  goodness-of-fit  test  can  be  derived  using  Theorem  4.1,  much  as  it  was 
done  using  Theorem  3.1. 

Let  0  =  zo  <  •  •  •  <  zi  =  1  so  that  the  intervals  Zt  =  ( zt~i,zi ]  partition  [0,1]. 
Let  X  =  y/n(R  -  R)  and  let  Q |n)  denote  the  increment  of  X  over  Z\.  Under  Ho  and 
the  conditions  of  Theorem  4.1  we  have  that  Q /  =  1  converges  in 

distribution  to  the  Gaussian  random  vector  Q  =  ( Qt ,  /  =  1  with  mean  zero  and 

covariance 

Co  v{QhQ,.)  =  p2  [b(l)2Hx(Z,  HZ,.)-  6(l)6(2,)#i(2i')  -  b(l)b{Z,.)  HX(Z,) 

+  b(Z, )  b{Zv  )H(  1, 1)  +  3  p(Z,)  ip(Zr )  E], 

where  H\(z)  =  77(1, z).  This  covariance  can  be  estimated  consistently  by  inserting  the 
usual  estimates  of  0o,  Ao  and  E;  and  estimating  H\  by  H i(-)  =  ^(1,-),  where  H  is  given 
by  (3.4). 


5.  Aakm’s  additive  risk  model  vs.  the  general  model 

For  simplicity,  we  shall  only  consider  the  following  special  case  of  Aalen’s  model  (1.2): 

W,z)  =  Q1(1)  +  Q2(t)z,  (5.1) 

where  2  is  scalar  valued,  but  our  approach  could  easily  be  extended  to  the  full  Aalen  model 
(1.2).  Weighted  least  squares  estimators  Aj  for  the  functions  Aj(t)  =  J*  Oj(s)  ds ,  t  €  [/oi  1], 
j  =  1,2  were  introduced  by  Huffer  and  McKeague  (1987)  and  McKeague  (1988a).  Here 
<o  is  fixed,  0  <  to  <  1.  The  reason  for  restricting  estimation  to  the  time  interval  [toil]  is 
that  it  is  not  possible  to  estimate  the  correct  weights  uniformly  over  the  whole  of  [0, 1],  as 
required  for  the  asymptotic  theory  developed  in  McKeague  (1988a).  In  this  section  Z>2  is 
restricted  to  functions  on  [/q,  1]  X  [0, 1], 
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The  null  hypothesis  that  we  intend  to  test  is  H o-  Aalen’s  additive  risk  model  (5.1) 
holds  over  the  region  [to,  l]  x  [0, 1].  Define  A  and  A  as  in  Section  2,  except  with  the  range 
of  integration  for  the  time  variable  going  from  to  to  t.  Since  A(t,z)  =  z  A\(t)  +  ^22  A2(t) 
under  Ho,  a  reasonable  estimator  for  A  under  H0  is 

A(t,z)  =  z  Ai(t)  +  \z2A2(t). 

Let  Yn(t)  =  y<(t)/( 0  <  Zi{t)  <  1)  and  yi2(t)  =y,(t)Zi(t) 7(0  <  Z<(t)  <  1).  Then  the 
intensity  of  the  counting  process  Ni  is  given  by 

A<(t)  =  ai(t)y<i(t)  +  02(0^2(0 

under  the  additive  risk  model.  Using  similar  notation  to  McKeague  (1988a),  the  weighted 
least  squares  estimator  A  is  defined  by 

A{t)=  [*  Y~(s)dN(s), 

Jto 

where  y_(s)  =  (y'(s)  VU(s) y(s))-1y'(s)  lU(s)  and  y(s)  =  (Tij(s))  is  the  n  x  2  matrix  of 
covariate  processes,  W(t)  is  the  n  x  n  diagonal  matrix  with  :th  diagonal  entry  (Aj(<))~1, 

A,(t)  =  d1(<)yil(t)  +  o2(t)yt2(0 

is  an  estimate  of  the  intensity  A ,(<),  and  d  =  (d2,  d2)'  is  the  smoothed  least  squares 
estimator 

*<&)**.* 

Here  A  =  (Aj,  A2)'  is  Aalen’s  least  squares  estimator 

A(t)=  [\Yl(s)Y(s))-iY(s)dN(s), 

Jo 

K  is  a  left-continuous  kernel  function  of  bounded  variation  having  integral  1,  support  [0, 1] 
and  bn  >  0  is  a  bandwidth  parameter.  Let  L(t)  and  V(t)  denote  the  2x2  matrices 
with  entries  Ljk(t)  ~EYij(t)  Y\k(t),  V,*(t)  =EY\j(t)Yik{t)  Af  !(t)  respectively.  Also,  for 
any  square  matrix  D,  let  D~x  denote  the  inverse  of  D  if  D  is  invertible,  the  zero  matrix 
otherwise. 


Theorem  5.1.  Suppose  that  the  processes  Yt  and  Zt  are  left-continuous  with  right  hand 
limits,  and  a2  are  Lipschitz,  the  matrix  functions  L(-)  and  V(  )  are  continuous,  L(t) 
and  V(t)  are  nonsingular  for  all  t  G  [0,1],  inf(<iZ)e[0,i]z  A(t,  z)  >  0,  bn  — * ■  0,  nb\  00, 
d~nj n  -+  00  and  dn  =  o(ns)  for  some  6  G  (|,1).  Then,  under  Aa/en’s  additive  risk  model 

(5.1),  y/n(A  -  A)  ^>m'  in  Di  as  n  —  00,  where 


i(y-'(*))ii +x(v-'(s))u] 

y/h{s,x) 


dW(s,x) 


[(y-^^bi  +  x^-^s))^ 


dlU(s,x), 


h(s,x) 


Ql(s)  -I-  P2(s)  X 
fz(s)  >■(»)(*,!) 
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Define  a  chi-squared  statistic  f  for  testing  Ho  in  the  same  way  that  f  was  defined 
in  Section  2,  but  with  X  =  y/n{A— .4).  Under  Ho  and  the  conditions  of  Theorem  5.1  we  have 
that  T  —  1) l  =  1  converges  in  distribution  to  the  Gaussian 

random  array  Q  =  (Qri,  r  =  1, . . . ,  R;  l  =  1, . . . ,  L)  with  mean  zero  and  covariance 

Cov(Qri,QrH')  =  H(Jrid  Jr'v) 

-  A, A,. 

where  A/  =  zt  —  zi_\.  A  consistent  estimator  of  this  covariance  can  be  obtained  by  esti¬ 
mating  the  first  term  by  H{JT\),  where 


[  [{V~'(s))n  +  i(A,  +  A,)(V-i(s))i2  +  i  A,  A,  (V~'(s))22}ds, 

J  Tr  nrr/ 


■*5t 


dAj(s)  -|-  xT  dAiis) 


and  estimating  the  remaining  terms  using  the  following  estimator  (see  McKeague,  1988a) 
of  f!0(V-'(s))jkds: 


E  f<y~(s))n<y~(9))ki  dNi(s). 

t=i  Jt° 


A  chi-squared  statistic  for  testing  Hq  can  then  be  developed  as  in  Section  3. 


6.  Proofs 


We  shall  make  repeated  use  of  the  notation  (/?„,  n  >  1)  for  a  generic  sequence  of 
processes  which  converge  uniformly  in  probabilty  to  zero  as  n  — *•  oo.  Also,  the  processes 


a4b)(0  =  E/ 

i= 1 

z)  =  [2  f  ~ 7^—  tfM<">(«) I(x  €  Ir)dx, 

r=lJ 0  JO  Yf  '( S ) 


play  an  important  role  in  the  proofs. 

Proof  of  Theorem  3.1.  By  the  proof  of  Theorem  3.1  of  MU  we  have  (in  our  notation) 

y/n(A-  A)  =  M  +  Rn.  (6.1) 


The  Lipschitz  condition  on  A  required  for  that  theorem  to  be  applicable  is  satisfied  for 
Cox’s  proportional  hazards  model  (3.1),  since  Aq  is  assumed  to  be  Lipschitz. 


The  next  step  in  the  proof  is  to  decompose  yfn{A  —  A)  using  the  results  of  Andersen 
and  Gill  (1982),  subsequently  referred  to  as  AG.  Conditions  A  to  D  of  AG  can  be  checked 
under  conditions  of  the  present  theorem  (cf.  the  proof  of  Theorem  4.1  of  AG).  Then  write 

-  A)(t ,  z)  =  y/n(A(t)  -  A0 (<))  K*)  +  -  e0oZ)  dx  A .(<)•  (6-2) 

Jo 

By  AG  (p.1104) 

Vn(A(t)  -  A0(<))  =  M0(t)  -  \/n0  -  fi0)  f  e(/30,u)  A0(u)  du  +  Rn(t)  (6.3) 

Jo 


where 


Mo(t)  =  »"*  £  I*  s(6)-(^u)I{°  -  Zi{u)  ~  !)  (u)- 


Also,  since  is  a  consistent  estimator  of  and  A  is  uniformly  consistent  estimator  of  Ao, 
by  a  Taylor  series  expansion  the  second  term  in  (6.2)  can  be  written  as 


Vn0  -  0o)  Ao(t)  f  xe0oXdx  +  Rn(t,z). 
Jo 


By  AG  (proof  of  Theorem  3.2) 


y/^0  -  0O)  =  n-*U(0 o,  1)S_1  +  oP(  1). 


Note  that  by  (3.3)  and  (2.1) 


u(m  =  E  jf  {*,•(«)  -  %;-g}/(°  <  <  i)^.-(tt) 


Let  xr  =  r/rf„,  and  introduce  the  martingale 


=  £  f'{Z:rHZM  e  I.) -fp^h1  S  z«(«)  < 


By  Doob’s  inequality  and  (6.6) 


£  sup  17.-4(7(00,1)  -  ail)!1  <  4  E  /'{  V( i.  -  Z,(«))/(Z,(u)  e  I,)}1  d{M1)l 
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Thus,  combining  (6.1)-(6.5),  we  obtain  the  decomposition 

y/n(A  -  A)(t,z)  =  M(t,z )  -  b(z)M0(t)  -  c(t,z)A/i(l)  +  Rn(t,z).  (6.8) 
Set 

(6-9) 

m'W=L  1 


Then  m o  and  m\  are  independent  zero  mean  Gaussian  martingales  with  predictable  varia¬ 
tion  processes 


A0(u) 


du. 


v((3o,u)s(°\/3o,u)  Xo(u) du. 


Suppose  that  Mi)^(m,mo,  mi)  jointly  in  D’  =  D2  x  D[0,  l]2.  Define  a  map 

-»  D'  by  Tt'n(f\,h,h)  =  (jrn(/i),/2, /a),  where  jr„  is  defined  by  tt n{f)(t,x)  = 
/(t,a;r-i)  +  dn(z-ir_1)/(t,zr)  for  x  6  IT,  where  xr  =  r/dn.  Here  tt „(/)(t,  •)  is  a  piecewise 
linear  approximation  to  /(t,  •)  based  on  the  points  xr,  r  =  1, . . .  ,«f„,  for  each  t.  Note  that 
jyr(")  =  7r„(Af(”)).  Also,  appealing  to  a  D'  version  of  Lemma  4.1  of  McKeague  (1988b)  we 
have  Tr'n(M ,  Mo,  Mi)^(m,mo,mi)  in  D' ,  where  we  have  used  the  fact  that  m  (defined  by 
(2.3)),  m0  and  have  continuous  sample  paths.  Thus  (M,  M0,  M\  )-^(m,  m0,  my )  jointly 
in  D'  and  by  the  continuous  mapping  theorem  and  (6.8)  we  may  conclude  that 


y/n(A  -  v4)^ 


m  -  b mo  -  cm\(l)  =  m'. 


It  remains  to  show  that  (M,  Mo,  M\)^(m,  m0,  mj )  jointly  in  D'.  By  (6.7)  and  the 
proof  of  Theorem  3.4  of  AG  we  have  (Mo,Mi)-+(m0,  n?i)  in  £>[0,  l]2.  Also,  by  the  proof 

of  Theorem  3.1  of_MU^we_have  M— in  D 2-  If  we  can  show  that  the  finite  dimensional 
distributions  of  (M,M0,  M\)  converge  to  those  of  (m,m0,m  i)  then  we  are  finished.  It 
suffices  to  show  that  for  any  0  <  z0  <  Zj  <  ■  •  •  <  zq  <  1,  q  >  1, 

((A/(-,Zj)  -  M(-,Zj_i))j=i,  Mo(-),  M1(-))-E((m(-,zj)  -  m(-,zj_1))^=1,  m0(-),  mi(-)) 

in  Z?[0,  l]9+2.  This  is  done  using  Rebolledo’s  (1980)  martingale  central  limit  theorem.  Since 
Mr,  r  =  1  are  orthogonal  martingales  and  Afo  can  be  written  in  the  form 


I 


I 
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we  have  (cf.  Lemma  9  of  MU) 


_  ,  I*d-1  ,t  ,  , 

<-(,»>.  HOfc-jrE/ 

p  i‘  r  «***>(•)  v. 


JJ 


0  Jo 


didu  =  (m(-,z),  m0(-))t. 


Also,  directly  from  the  definitions  of  M  and  A/j 


<"(-)■  *<•»■  *  i  -  !»{*$} 

~J  Jo  {I'^t^}e&'Ao(u>''X‘'“  =  (m(''Z)'mi(')>‘- 


Now  apply  the  version  of  Rebolledo’s  central  limit  theorem  given  by  AG  (Theorem  1.2) 
with  p  =  q  +  2  and  dn  playing  the  rolejpf  n.  There  are  9  +  2  Lindeberg  conditions  to 
check.  For  the  q  components  involving  A/(-,z),  these  conditions  follow  from  Lemma  6  of 
MU.  The  same  approach  works  for  the  Mi  component,  and  the  Mo  component  is  treated 
in  AG  (proof  of  Theorem  3.4).  □ 


Prooj  of  Theorem  5.1.  First  note  that  by  the  proof  of  Theorem  3.2  of  McKeague  (1988a) 
we  can  decompose  y/n(A  -  A)  as 

\fn{A-  A){t,z)  =  zy/n(Ai  -  ^i)(0  +  ^2s/n{A2  -  M){t) 

=  zA/,(<)  +  \z2M2(t)  +  Rn(t,z),  (6.10) 

where 

^(()  =  ^E  /<G|;)(s)dA/l(s), 

V  n  fr'i  Jto 

G(n),s)  =  {  G(*\s)  if  \G^(s)\  <  C 
,J  l  C  otherwise, 

2 

<4n)(*)  =  Y,(V-\s))3kYxk(s)X-Hs), 

k= 1 

U(s)  =  (Ajfc(s)),  (a  2  X  2  matrix) 

i=i 

and  C  is  a  positive  constant  such  that 

P(G<;)(s)  =  (^(s)  for  alii  =  1 . n,  s  £  [<o,l])  -  1-  (6.11) 
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By  (3.1)  and  (6.10)  we  obtain  the  following  decomposition 


y/n(A  -  A)(t,z)  =  M(t,z)  -  zM\(t)  -  \z'1M2(t)  +  Rn(t,z).  (6.12) 


Set 


l(V-1(*))ji+*(V'I(*))j*] 

>/h(s,x) 


d\V($,x), 


t0<t<l 


for  j  =  1,2.  Then  (mi, m2)  is  a  bivariate  Gaussian  martingale  with  zero  mean  and,  as 
routine  calculations  show,  predictable  covariation  processes 


( mj,mk)t=  f  (V_1(s))j*ds,  j,k  =  1,2. 

Jt0 

By  the  proof  of  Theorem  3.2  of  McKeague  (1988a)  it  follows  that  (Mi ,  M2)^(mi ,  m2) 
in  D[to»l]2-  Also,  by  the  proof  of  Theorem  3.1  of  MU,  we  have  in  D 2,  where  m 

is  defined  in  the  statement  of  Proposition  2.1.  Thus,  from  the  representation  (6.12),  to 
complete  the  proof  (cf.  the  proof  of  Theorem  3.1)  it  suffices  to  show  that  for  any  0  <  zo  < 
zi  <  •••  <  zq  <  1,  q  >  1, 


((Mhz>)-M(.,z;_1))J=1,M,(0,M2O)^M^>)-^(-J^-i))j=1^i(-),m2(.)) 

in  D[to,  1],+2.  As  in  the  proof  of  Theorem  3.1,  we  apply  the  version  of  Rebolledo’s  mar¬ 
tingale  central  limit  theorem  given  in  AG  (Theorem  1.2)  to  do  this.  Note  that  Mj,  j  -  1,2 
are  square  integrable  martingales  and 


■£  /  /  [(^  1(*))ji  +  i(Vr~1(s))i2]dsdi  =  (m(-,z),mj(-))u 
Jo  Jt0 


where  we  have  used  (6.11),  Lemma  4.3  of  McKeague  (1988a)  and  the  fact  that  |z-zr|  <  d~l , 
when  z  €  Ir-  Once  again  there  are  q  +  2  Lindeberg  conditions  to  check.  They  have  been 
checked  for  Mj,  j  =  1,2  in  the  proof  of  Theorem  3.2  of  McKeague  (1988a),  and  for  the  q 
components  involving  M(-,z)  in  MU  (Lemma  6).  □ 
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